S4Net: Single Stage Salient-Instance Segmentation
نویسندگان
چکیده
In this paper, we consider an interesting vision problem—salient instance segmentation. Other than producing approximate bounding boxes, our network also outputs high-quality instance-level segments. Taking into account the category-independent property of each target, we design a single stage salient instance segmentation framework, with a novel segmentation branch. Our new branch regards not only local context inside each detection window but also its surrounding context, enabling us to distinguish the instances in the same scope even with obstruction. Our network is end-to-end trainable and runs at a fast speed (40 fps when processing an image with resolution 320 × 320). We evaluate our approach on a public available benchmark and show that it outperforms other alternative solutions. In addition, we also provide a thorough analysis of the design choices to help readers better understand the functions of each part in our network. To facilitate the development of this area, our code will be available at https://github.com/RuochenFan/S4Net.
منابع مشابه
بخشبندی معنادار مدل سهبعدی اجسام بر اساس استخراج برجستگیها و هسته جسم
3D model segmentation has an important role in 3D model processing programs such as retrieval, compression and watermarking. In this paper, a new 3D model segmentation algorithm is proposed. Cognitive science research introduces 3D object decomposition as a way of object analysis and detection with human. There are two general types of segments which are obtained from decomposition based on thi...
متن کاملHierarchical image simplification and segmentation based on Mumford-Shah-salient level line selection
Hierarchies, such as the tree of shapes, are popular representations for image simplification and segmentation thanks to their multiscale structures. Selecting meaningful level lines (boundaries of shapes) yields to simplify image while preserving intact salient structures. Many image simplification and segmentation methods are driven by the optimization of an energy functional, for instance th...
متن کاملReasoning about Object Instances, Relations and Extents in RGBD Scenes
The vast majority of literature in scene parsing can be described as semantic pixel labeling or semantic segmentation: predicting the semantic class of the object represented by each pixel in the scene. Our familiar perception of the world, however, provides a far richer representation. Firstly, rather than just being able to predict the semantic class of a location in a scene, humans are able ...
متن کاملRobust Feature Detection for 3D Object Recognition and Matching
Salient surface features play a central role in tasks related to 3D object recognition and matching. There is a large body of psychophysical evidence demonstrating the perceptual signiicance of surface features such as local minima of principal curvatures in the decomposition of objects into a hierarchy of parts. Many recognition strategies employed in machine vision also directly use features ...
متن کاملNew benchmark for image segmentation evaluation
bstract. Image segmentation and its performance evaluation are ery difficult but important problems in computer vision. A major hallenge in segmentation evaluation comes from the fundamental onflict between generality and objectivity: For general-purpose egmentation, the ground truth and segmentation accuracy may not e well defined, while embedding the evaluation in a specific appliation, the e...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1711.07618 شماره
صفحات -
تاریخ انتشار 2017